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The act of measurement bridges the quantum and 
classical worlds by projecting a superposition of possi- 
ble states into a single, albeit probabilistic, outcome. 
The time-scale of this "instantaneous" process can be 
stretched using weak measurements 1,2 so that it takes 
the form of a gradual random walk towards a final 
state. Remarkably, the interim measurement record is 
sufficient to continuously track and steer the quantum 
state using feedback 3-8 . Here, we report the first im- 
plementation of quantum feedback control in a solid 
state system, in our case a superconducting quantum 
bit (qubit) coupled to a microwave cavity 9 . Probing the 
state of the cavity with less than one photon on aver- 
age, implements a weak measurement of the qubit state. 
These photons are then directed to a high-bandwidth 
quantum-noise-limited amplifier 10 ' 11 , which enables real- 
time monitoring of the state of the cavity — and hence 
that of the qubit — with high fidelity. We demonstrate 
quantum feedback control by inhibiting the decay of 
Rabi oscillations, allowing them to persist indefinitely 12 . 
This new ability permits active suppression of decohcr- 
ence and defines a path for continuous quantum er- 
ror correction 13 ' 14 . Other novel avenues include quan- 
tum state stabilization 4 ' 7 ' 15 , entanglement generation us- 
ing measurement 16 , state purification 17 , and adaptive 
measurements 18 ' 19 . 

Feedback protocols in classical systems, from anti- 
lock brakes to pacemakers, use the outcome of a mea- 
surement to stabilize the system about a desired state. 
The operation of such feedback protocols is predicated 
on the idea that measurement does not itself alter the 
state of the system. This is no longer true in quantum 
mechanics where measurement is necessarily invasive 1 . 
In the Copenhagen interpretation, a quantum object 
can exist simultaneously in more than one stable state 
or eigenstate until observed — Schrodinger's celebrated 
"dead and alive" cat being the quintessential hypotheti- 
cal example 20 . A sense of reality is restored by the act of 
measurement which forces the system "instantaneously" 
into one of its eigenstates in a probabilistic fashion (the 
so-called measurement back-action). In light of this ef- 
fect, how does one apply measurement-based feedback to 
a quantum system such as a qubit? 

One solution is to use weak measurements 1 ' 2 where we 
deliberately limit the rate (r meas ) at which information 
is extracted, thereby slowing down the qubit 's random 
walk towards an eigenstate. Integral to this scheme is a 
detector with efficiency ?7det = r moas /r v — > 1, where T v 



is the ensemble averaged dephasing rate due to measure- 
ment back-action 21 . The high detector efficiency allows 
us to faithfully track the qubit continuously, and steer it 
to a desired state using real-time feedback. 

The experimental setup is shown in Fig. 1. Our 
quantum system (Fig. lb) is an anharmonic oscilla- 
tor realized by a capacitively-shunted Josephson junc- 
tion, dispcrsivcly coupled to a 3D microwave cavity 22 . 
We use its two lowest energy levels (woi/27r = 5.4853 
GHz) to form a qubit (transmon 23 ). The cavity res- 
onant frequency with the qubit in the ground state is 
oj c /2w = 7.2756 GHz. The strongly coupled output 
port sets the cavity linewidth k/2it = 13.4 MHz while 
control and measurement signals are injected via the 
weakly coupled input port (Figs, la and lb). The qubit- 
cavity coupling results in a state-dependent phase shift 
[A0 = 2 tan -1 {2 X /k) = 12°, x/2tt = 0.687 MHz] of the 
cavity output field 9,24 , with the state information con- 
tained in one quadrature of the signal. The cavity output 
is sent to a near-noiseless (%et ~ 1) phase-sensitive para- 
metric amplifier (paramp) 10 ' 11 which boosts the relevant 
quadrature to a level compatible with classical circuitry. 
The paramp output is further amplified and homodyne 
detected (Fig. lc) such that the amplified quadrature 
(Q) contains the final measurement signal. 

We acquire Rabi oscillations with the cavity continu- 
ously excited at cj r /27r = 7.2749 GHz (w r « uj c — x) with 
a mean cavity photon occupation n which controls the 
measurement strength (see section II of supplementary 
information for calibration of n ). The Rabi drive at the 
ac Stark shifted 25 qubit frequency (woi — 2xn) is turned 
on for a fixed duration r m . The amplitude is adjusted 
to yield a Rabi frequency VLr/2h — 3 MHz. First, we 
average 10 4 measurement traces to obtain a conventional 
ensemble- averaged Rabi oscillation trace (Fig. 2a). Even 
though the qubit is continuously oscillating between its 
ground and excited states, the oscillation phase diffuses 
primarily due to measurement back-action. As a result, 
the averaged oscillation amplitude decays over time. The 
persistent nature of these oscillations, however, is evi- 
dent in the frequency domain response 26 . We Fourier 
transform the individual measurement traces and plot 
the averaged spectrum in Fig. 2b (blue trace). A peak 
centered at 3 MHz with a width V is observed and re- 
mains unchanged even when r m is much longer than the 
decay time of the ensemble averaged oscillations. A plot 
of the peak width T for different measurement strengths 
(in units of n) is shown in Fig. 2c. As expected in the 
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dispersive regime, T and n are linearly related 25 . The ver- 
tical offset is dominated by pure enviromental dephasing 
r onv but has contributions from qubit relaxation (Ti ) and 
thermal excitation into higher qubit levels; more details 
can be found in sections II and IV(C) of the supplemen- 
tary information. 

The ratio of the height of the Rabi spectral peak to 
the height of the noise floor has a theoretical maximum 
value of four 27 , corresponding to an ideal measurement 
with overall efficiency rj = 1. For our setup, this efficiency 
can be separated into two contributions as 77 = r/dct Venv 
The detector efficiency is given by ?7dot = [1 + 2n a dd] _1 
with n a dd being the number of noise photons added by 
the amplification chain. The added noise is referenced to 
the output of the cavity and includes the effect of signal 
attenuation between the cavity and the amplifier. The 
effect of environmental dephasing r onv is modeled using 
Vcnv — (1+Tcnv/r^) ■ The best measurement efficiency 
we obtain experimentally is rj = 0.40, with ?7dct = 0.46, 
and Tfenv = 0.87; more details can be found in section III 
of supplementary information. 

We now discuss the feedback protocol which can be 
understood as a phase-locked loop stabilizing a quantum 
oscillator. The amplified quadrature (Q) is multiplied by 
a Rabi reference signal with frequency £Io/2tt = 3 MHz 
using an analog multiplier (Fig. Id) . The output of this 
multiplier is low-pass filtered and yields a signal propor- 
tional to the sine of the phase difference 9 CII between the 
3 MHz reference and the 3 MHz component of the ampli- 
fied quadrature. This "phase error" signal is fed back to 
control the Rabi frequency by modulating the Rabi 
drive strength with an upconverting IQ mixer (Fig. la). 
The amplitude of the reference signal controls the dimen- 
sionlcss gain F given by fifb/^R, = — Fsin(# crr ) where 
Ofb is the change in Rabi frequency due to feedback. Fig. 
2d shows a feedback-stabilized oscillation which persists 
for much longer than the original oscillation in Fig. 2a. 
In fact, within the limits imposed by our maximum data 
acquisition time of 20 ms, these oscillations persist indef- 
initely. The red trace in Fig. 2b shows the corresponding 
averaged spectra. The needle-like peak at 3 MHz is the 
signature of the stabilized Rabi oscillations. 

To confirm the quantum nature of the feedback- 
stabilized oscillations, we perform state tomography on 
the qubit 28 . We stabilize the dynamical qubit state, stop 
the feedback and Rabi driving after a fixed time (80 /_ts 
+ T tomo after starting the Rabi drive) , and then measure 
the projection of the quantum state along one of three 
orthogonal axes. This is done by using strong measure- 
ments (by increasing n) with single-shot fidelity 11 ' 29,30 
which allow us to remove any data points where the qubit 
is found in the second excited state (see section IV(C) 
of supplementary information). By repeating this many 
times, we can determine (crx), (<ry), and {<Jz), the three 
components of the Bloch vector for the ensemble qubit 
state. Fig. 3a shows a plot of the Bloch vector compo- 
nents for different time points (T tomo ) over one oscillation 
period [l/(27rfi )]. The Y and Z components are well fit 



by a sinusoidal function, whereas the X component is 
nearly zero as expected for a coherent Rabi oscillation 
about the X axis. The efficiency of the feedback process 
is reflected in the non-unit amplitude of these oscillations. 
This feedback efficiency D is given by the time-averaged 
scalar product of the desired and actual state vectors on 
the Bloch sphere (see section IV(A) of supplementary in- 
formation). In our experiment, the measurement is weak 
enough that the stabilized oscillations are sinusoidal and 
D is approximately equal to the amplitude of these os- 
cillations. 

In Fig. 3c we plot D (red squares) versus the dimcn- 
sionless feedback strength F. We find a maximum value 
of D = 0.45 for the optimal choice of F. The dashed 
black line is a plot of the theoretical expression for D 
given by 



■n v/q r f 

which is derived using a simple analytical theory based on 
the Bayesian formalism for the qubit state trajectory (see 
section IV(A) of supplementary information) but docs 
not account for finite feedback bandwidth, loop delays 
in the circuit or qubit relaxation. The maximum value 
= y/fj is obtained for an optimal feedback strength 
F opt = y/fj T/Q R . A value of D max < 1 implies that the 
stabilized state is a mixed state; this occurs for r\ < 1, 
implying that we have incomplete information about the 
qubit state. Another way to visualize the stabilized state 
with D < 1 is shown in Fig. 3b. The Bloch vector during 
a single measurement is roughly within a certain angle of 
the desired state resulting in the averaged Bloch vector 
(red arrow) having a magnitude smaller than one. In 
principle, it is possible to approach a pure state with 
D = 1 by ensuring 77 = 1 and minimizing feedback 
loop delay. To account for the finite loop delay (250 
ns), feedback bandwidth (10 MHz) and qubit relaxation 
(7\ = 20 (is), we performed full numerical simulations of 
the Bayesian equations for qubit evolution (see section 
IV (B) of supplementary information). The results are 
shown as a black solid line in Fig. 3b and agree well with 
our experimental data. 

In conclusion, we have demonstrated a continuous ana- 
log feedback scheme to stabilize Rabi oscillations in a 
superconducting qubit, enabling them to persist indefi- 
nitely. The efficiency of the feedback is limited primarily 
by signal attenuation and loop delay, and can be im- 
proved in the near future with the development of on-chip 
paramps and cryogenic electronics, respectively. We an- 
ticipate that our present technology, with minimal mod- 
ifications, is sufficient to implement continuous quantum 
error correction of a logical qubit encoded in multiple 
physical qubits using pairwise parity measurements 13 ' 14 . 
This development will also usher in a new era of mea- 
surement based quantum control for solid-state quantum 
information processing 4, 7,15-19 . 
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FIG. 1: Experimental setup, (a) shows the signal generation setup. One generator provides the Rabi drive at the 
ac Stark shifted qubit frequency (ojoi — 2xn), while the output of another generator at 7.2749 GHz is split to create the 
measurement signal, paramp drive and local oscillator. The relative amplitudes and phases of these three signals are controlled 
by variable attenuators and phase shifters (not shown), (b) shows a simplified version of the cryogenic part of the experiment; 
all components are at 30 mK (except for the HEMT amplifier, which is at 4 K). The combined qubit and measurement 
signals enter the weakly coupled cavity port, interact with the qubit, and leave from the strongly coupled port. The output 
passes through two isolators (which protect the qubit from the strong paramp drive), is amplified, and then continues to the 
demodulation setup. The coherent state at the output of the cavity for the ground and excited states is shown schematically 
before and after parametric amplification, (c) The amplified signal is homodyne detected and the two quadratures are digitized. 
The amplified quadrature (Q) is split off and sent to the feedback circuit (d), where it is multiplied with the Rabi reference 
signal. The product is low-pass filtered and fed back to the IQ mixer in (a) to modulate the Rabi drive amplitude. 
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FIG. 2: Rabi oscillations and feedback, (a) shows conventional ensemble-averaged Rabi oscillations measured using weak 
continuous measurement, which decay in time due to ensemble dephasing. In (b), the individual measurement traces from 
(a) are Fourier transformed before averaging. The averaged spectrum shows a peak at the Rabi frequency (blue trace) with 
a width r/27r (FWHH). The grey trace shows an identically prepared spectrum for the squeezed quadrature (multiplied by 20 
for clarity), which contains no qubit state information. V is plotted as a function of cavity photon occupation n (measurement 
strength) in (c), showing the expected linear dependence, (d) shows feedback-stabilized ensemble averaged Rabi oscillations, 
which persist for much longer times than those without feedback seen in (a). The corresponding spectrum, shown in (b), has 
a needle-like peak at the Rabi reference frequency (red trace). The slowly changing mean level in the Rabi oscillation traces 
in (a) and (d) is due to the thermal transfer of population into the second excited state of the qubit. See section IV(C) of 
supplementary information for more details. 




FIG. 3: Tomography and feedback efficiency, (a) shows quantum state tomography of the feedback-stabilized state. We 
plot (ox) , (oy) an d (pz) for different time points T tomo in one full Rabi oscillation of the qubit. The solid lines are sinusoidal 
fits. The magnitude of these sinusoidal oscillations is approximately equal to the feedback efficiency D = 0.45. (b) shows a 
visualization of the feedback stabilized Bloch vector for D < 1 . The Bloch vector during a single measurement remains roughly 
within a certain angle (shaded pie region) of the desired state; higher the efficiency D, smaller the angle. Tomography of the 
stabilized state measures the average over many iterations which reduces the length of the averaged Bloch vector (red arrow). 
In (c), we plot D as a function of the dimensionless feedback strength F. Solid red squares are experimental data with a 
maximum value of D = 0.45. The dashed black line is a plot of Eq. (1) with 77 = 0.40 and T/27r = 0.154 MHz (n = 0.47, 
F nv/27r = 0.020 MHz), while the solid black line is obtained from full numerical simulations of the Bayesian equations including 
finite loop delay (250 ns) and feedback bandwidth (10 MHz). 
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I. DEVICE FABRICATION AND PARAMETERS 

The transmon qubit was fabricated on a bare high-resistivity Si wafer using electron beam lithography and double- 
angle Aluminum evaporation with an intervening oxidation step. The qubit is a single Josephson junction with two 
rectangular paddles (425 /im x 225 /zm) which provide the shunting capacitance and coupling to the cavity. The cavity 
was machined out of Aluminum (6061 alloy). The quality factor of the cavity was adjusted by controlling the length 
of the center conductor of the SMA coaxial connector protruding into the cavity volume. This was done for the 
output port to provide strong coupling while the input port was coupled weakly to provide a net power transmission 
at resonance of -20 to -30 dB. 

The Josephson (Ej) and charging energy (Ec) of the transmon qubit were determined by qubit spectroscopy which 
yielded transition frequencies cj i/27t = 5.4853 GHz and u>o 2 /2it = 10.7382 GHz. We then calculated Ej = 19.274 
GHz and Eq = 0.211 GHz giving Ej/E c = 91. The qubit relaxation time was measured to be Ti = 20 fxs while echo 
experiments yielded T 2 * = 8 [is. 
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II. DISPERSIVE SHIFT AND PHOTON NUMBER CALIBRATION 

In order to determine the dispersive shift 2% of the cavity between the qubit's ground and excited states, we use 
a combination of ac stark shift (Aw ac = 2\n) and measurement induced dcphasing of the qubit (T v = 8x 2 n//-t), 
where n is the average photon occupation of the readout cavity 1 . To measure these quantities precisely, we perform 
a Ramsey fringe experiment where the free evolution period between the two 7r/2 pulses is modified by exciting the 
cavity with a fixed power P at the readout frequency u> T . By fitting the Ramsey fringes to an exponentially decaying 
sinusoidal function, we measure Aw ac by extracting the Ramsey frequency and the dcphasing rate (rR amscy = T^+T^) 
by extracting the decay constant. Here T 2 * = l/Y* 2 is the decay constant of the Ramsey fringes in the absence of 
ly^ any photons in the cavity. This technique is significantly faster than conventional spectroscopy 1 and provides better 
precision in extracting Aix> ac and Y v . We repeat this process for different P and since n cx P, a plot of Aa; ac vs P and 
TRamsey vs P gives two straight lines with slopes m ac and m v (Fig. SI). The ratio m v jm ac = then allows us to 
determine the dispersive shift 2% = 1.375 MHz. We use this value of 2\ and the stark shift data to get a calibration 
j>. for the average photon number n in the cavity. 

S3 III. DETECTOR AND MEASUREMENT EFFICIENCY 



In our experiment, the overall measurement efficiency relevant for feedback is given by r\ — r/dct lew The first 
term ?7 dct accounts for the noise added by the amplification chain. To measure the noise added by the amplifier, 
one typically uses a calibrated noise source. Instead, we use the qubit+cavity system as a calibrated signal source. 
As discussed in the previous section, we can excite the cavity with a precise average photon occupation n. This 
corresponds to a power radiated from the cavity P ra d = haj r n where uj r is the frequency of excitation. We send this 
signal to the paramp and measure the signal to noise ratio (SNR) at the output. This allows us to extract the noise 
floor of the amplification chain which includes a dominant contribution from the paramp and a smaller contribution 
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from the HEMT amplifier. 

The noise floor referred to the input of amplification chain is given by P n = hw r B / r/det where B is the integration 
bandwidth (in Hz). We calculate the detector efficiency as 

SNR B 

met = — — • (1) 
n K 

We repeat this experiment for a range of frequencies within the paramp bandwidth and extract an average detector 
efficiency r\dct = 0.46. Note that the actual power reaching the paramp is smaller due to signal attenuation between 
the cavity and the paramp. This attenuation reduces the overall efficiency and our measurement technique correctly 
measures the efficiency of the amplification chain starting from the output of the cavity. With the help of cryogenic 
switches we independently measure the signal attenuation to be roughly 2.5 dB implying that this mechanism is the 
dominant mechanism for reduction in detector efficiency. Future experiments will endeavor to minimize this signal 
attenuation. 

The final contribution to the overall measurement efficiency is due to environmental decoherence via pure dephasing. 
The efficiency ?7 env = (f + T^/T^)^ 1 characterizes how much of the total dephasing is due to measurement. In 
principle one can keep increasing T v by increasing n to improve this efficiency but there arc two practical constraints. 
The measurement should be weak enough such that it is not projective on the timescale of the Rabi period fi^ 1 thus 
leaving the qubit evolution oscillatory. Furthermore, the feedback bandwidth required increases with T v . Since the 
effective feedback bandwidth is fixed by the measurement chain, the feedback efficiency D decreases with increasing 
measurement strength. The data shown in Fig. 2 and Fig. 3 corresponds to an optimal choice of T v = 8x 2 n/n = 0.134 
MHz to give the maximum value of D. Now dephasing due to low frequency noise does not affect the feedback efficiency 
because the system can track any slow variations in the qubit frequency (and consequently in the Rabi frequency). 
Hence, we set r cnv = 1/T£ measured from echo experiments giving us r env /27r = 0.02 MHz, and ?7 onv = 0.87. Note 
that this definition includes the dephasing contribution from qubit relaxation (Ti/2). 



IV. BAYESIAN FORMALISM AND FEEDBACK EFFICIENCY 



In this section, we will briefly describe the "quantum Bayesian" formalism 2 , which, broadly speaking, is similar 
to the "quantum trajectory" theory 3-5 . The assumptions needed for applicability of the formalism to a circuit QED 
setup 6 (dispersive interaction, "bad cavity" regime, and weak response) are well satisfied in our experiment. Since 
we use a phase-sensitive detector to amplify the optimal signal quadrature, there is essentially no back-action on the 
qubit from photon number fluctuations in the resonator 6 , and thus the present case is identical to that of a qubit 
measured by a quantum point contact. Note that we will be using the symbol I(t) to describe the measurement 
output signal to be consistent with reference 2 . This should not be confused with the T quadrature of the mixed down 
signal discussed in the main text where the measurement output is the 'Q' quadrature. 

We will first consider the case where the detector is ideal (r^ct = !)• The qubit evolution during the process of 
continuous measurement can be described using stochastic equations 2 for the qubit density matrix p. For a resonant 
Rabi drive, the equations (in Stratonovich form) are given by 

Pn = -Poa = -^Rlmpoi + PuPoo^-[I(t) - ^ t ~ ] ~ r iJ°n ( 2 ) 

bid 2 



Poi = (pn - Poo) - 5^01 (pn - Poo) [I(t) - ° 2 - ] - (r c „ v + y )Pi2 (3) 

where Or, is the Rabi frequency, r onv is the environmental dephasing rate, Ti = 1/T\ is the qubit relaxation rate. 
The measurement output signal I(t) is given by 

^) = ^r^ + ^[pii-poo]+6dM w 

where Iq and I\ are the average output signals for the qubit in ground and excited state respectively, AI = I x — I is 
the response and £;d is the white noise of an ideal detector characterized by the (one-sided) spectral density Sid- Note 
that for solving these equations, the choice of Jo, h and is somewhat arbitrary and it is the ratio of (AT) 2 and 
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Sid that matters. The strength of the measurement which is characterized by the measurement induced dephasing 
rate is given by 



We use two slightly different methods to account for measurement efficiency 77 when solving equations (2), (3) and 
(4) analytically (section IV A) or numerically (section IV B). 



A. Analytical derivation of feedback efficiency 



We now derive an analytical expression for the feedback efficiency D, neglecting delays in the feedback loop and 
the effect of finite bandwidth in the resonator, amplifier, and feedback circuit. We also assume weak coupling, and no 
qubit energy relaxation (ri = 0). We do, however, take into account arbitrary measurement efficiency rj. The total 
dephasing rate is given by T = T v + T cnv . 

To obtain a closed form expression, it is possible to reduce the number of qubit degrees of freedom down to only 
one. First, on account of a resonant Rabi drive which rotates the qubit about the x-axis, the process of measurement 
eventually attracts the qubit to the x = plane. Second, in the absence of energy relaxation, we can consider the 
qubit state as pure, ascribing any measurement inefficiency (77 < 1) to the additional noise at the detector output 7 . 
Note that 77 includes the effect of environmental dephasing r onv (via 77 env ) as well as detector efficiency rydct- So we 
set r env = in equation (3) and model both environmental dephasing and detector inefficiency by adding the noise 
term £ a dd(i) to equation (4) and we get 

I{t) = + ^L [pil poo ] + 6d (t) + £add(t) (6) 

where the added noise £ a dd(i) has a spectral density S a dd = S out — Sid, where Sid = (A/) 2 /(4r) and S ou t = Sid/>7 
is the total output noise. Therefore, the qubit state evolution can be described by only one parameter, the polar 
(zenith) angle 6(t) on the Bloch sphere: 

z(t) = cos[0(t)], y(t) = sin[0(i)], x(t) = 0. (7) 



The goal of the feedback is to produce 9(t) = VL t with a fixed frequency $l . We characterize the feedback efficiency 8 

by 



z? = cos0 crr (i), e CII = e-n t, (8) 

which is the time-averaged scalar product of the desired and actual state vectors on the Bloch sphere. An equivalent 
definition via the density matrix is D = 2 Tr(p dc siredPactuai) ~ 1- The qubit "phase shift error" 9 ey - r evolves as 8 

derr = - ^ sin 6 (^COSfl + fid) +ftfb(t), (9) 



Sid V 2 

where fifb(t) is the modulated part of the Rabi frequency due to feedback 

nR(t) = ft + nfb(t), ( 10 ) 

which is given by the "direct feedback" control law: 

<W'> F 4 M n 0<) ( /(t _ )-^±ZlV (11) 



fi AI 

Here F is the dimensionless feedback strength and the choice of the normalization factor 4/ AI corresponds to Ofb /flo = 
— Fsin^cn- on average. We use the notation I(t — 0) which reminds us about the unavoidable delay in the feedback 
loop and is assumed to be very small for this calculation. Moreoever, we have neglected effects of finite feedback 
bandwidth. 
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The qubit evolution (9) is written in the Stratonovich form; converting it into the Ito form (for averaging) we obtain 
the extra term [(A/) 2 /4S id ] sin 9 cos 9, which comes from the measurement part of Eq. (9) [the feedback (11) is the 
same in both forms, and there is no cross-term because of non-zero loop delay - see Ref. 4 and also Comment and 
Reply]. However, this extra term in the Ito form is not important because we average the evolution of the phase shift 
6* orr over the Rabi period. The averaging is simple when 6* err evolves slowly, so that 6* crr is uncorrelated with 9. Thus, 
we need to assume weak coupling, T -C fio and weak feedback, F <C 1. Averaging cancels the product sin 9 cos 9 and 
replaces sin(f2 i) cos 9 with — (sin# Grr )/2; thus we obtain 



^crr = (^^p sin(fi *) - sm(n t + 6 e „)j £id 



H — sm(Q. t) £add - Fn sm9 CII . (12) 

In this equation, the last term attracts the phase shift 9 CII to zero, while the noise terms cause diffusion of 9 elI . 
Examining the term in large parentheses, it is clear why there is an optimum value of the feedback strength F. 
For example, for an ideal detector (rj — 1, then £ add = 0), the effect of the noise £; d can be compensated when 
4Ftt /AI = AI/Sid, leading asymptotically to full synchronization, 9 eTT (t) = 0. This compensation was studied 
previously 9,10 in the context of qubit state stabilization about a fixed point on the Bloch sphere. 

Now let us average the noise in Eq. (12) over a Rabi period. We can replace sin(f2 £) £add with £ add /\/2, where £ add 
is also white noise with the same spectral density as £ add . Averaging the term with £; d is similar, but slightly more 
cumbersome. We first rewrite it as [Acos(fl t) + B sin(f2ot)]£i d with A = — (AI / S^) sin# err and B = 4Fil /AI — 
(AI/Sid) cos # crr . Averaging over a Rabi period then gives \J (A 2 + B 2 )/2 £; d with a similar white 

Let us now add the uncorrelated contributions from the noises £; d and £ add , and convert the result into a noise C £ ut> 
where £ out has the same spectral density 5 out as the output noise and C 2 = r](A 2 + B 2 )/2 + (1 — r])(AF£l / 1 AI) 2 /2. 
In this way we replace Eq. (12) with 

#crr = — FQ sin# orr + C£ out , Sf = S out , (13) 



.,o _ 2 /'<>,, I F I <>,, 



C 2 = ^ --— + ^-2cos0 crr . (14) 
This is a Langevin equation, and the corresponding Fokker-Planck equation for the probability distribution P(9 err , t) 

is 

8P d(TO sinfl crr P) 1 d 2 (C 2 S out P) 
dt d9 m + 4 d9 2 lr ' 

where P(9 cir ) is 2n periodic and is normalized as P(9 CII ) d9 cir = 1. The stationary solution P s t(#err) then satisfies 
equation d(C 2 S out P s t)/d9 CII + AFCIq sin 6 eTT P st = const = 0, where the constant is zero because of the symmetry 
between 9 crr and — 6 elI . Using the C 2 (9 cir ) dependence from Eq. (14), we get 

Pst(O m )= Po (^Y^ + I -^- 2co8d c") < ( 16 ) 

where the normalization constant p can also be calculated explicitly (not needed here) . 

Finally, from the stationary probability distribution for the phase shift 9 err , we calculate the feedback efficiency as 
D = cos 9 cir Pst^err) d9 err and thus obtain the analytical formula, 

D = i_f_ 2 fav 
v r/o + f 

which we have also confirmed by numerical simulations described later. From Eq. (17) it is straightforward to calculate 
the optimal value of the feedback factor F and corresponding maximum value for D: 

r 

F op t = VV <=T' ^mas = y/V- (18) 
"0 

Notice that F pt <C 1 for weak coupling, T <C Slo, so the assumption of weak feedback, F <C 1, is satisfied. Also, 
|^fb(i)l ^ if the filter bandwidth is well below S7§/r, which is satisfied in the experiment. 
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B. Numerical simulations 



We now discuss numerical simulations of the Bayesian equations. To avoid the complications with the Stratonovich 
vs the ltd form for the stochastic differential equations, we used a two step process to evolve equations (2) and (3). 
First, we set AI = i.e. we suppress the measurement and evolve the the resulting ordinary differential equations 
using a 4 th order Runge-Kutta step. We then include the effect of the measurement by performing a Bayesian 2 ' 6 
update which we describe below. The measurement output I m = r _1 J* +T I(t )dt in a given time interval r, is drawn 
from a Gaussian probability distribution with standard deviation a = \J Sid/{2r) and centered around Iq and I\ for 
the qubit in state |0) and |1) respectively. The conditional probability distributions are given by 



P(Im I |0» 



1 



=exp 



Urn - lo) 2 

2a 2 



. P(Im\\l)) 



1 



zCXp 



(Jm-Jl) 2 

2a 2 



(19) 



Given an initial qubit state p(t), the measurement outcome is drawn from a combined probability distribution 

P(Im) = Poo(t)P(I m | |0» + pu(t)P(I m | |f». (20) 

We use a combination of a binomial and a Gaussian random number generator to create a measurement outcome I m 
which is then used to update the qubit state using the following equations 2 



Pll (t)P(I m | |f)) 



Poo(t + r) 



Poo(t)P(I m I |0» 

P(Im) 



(21) 



Poi(* + t) = poi(*) 



y/pil(t + T)poo(t + T) 



(22) 



This process is repeated for each time step to obtain the qubit density matrix p(t) and the measurement output 
I(t) as a function of time. If ?7dct < 1, we add £ a dd(i) to I(t) which is generated using an appropriate Gaussian 
white noise generator. Since r onv is now included in equation (3), the spectral density for the added noise is given by 
Sadd = ^out — Sa, where S out — Sid/^det is the total output noise. In other words, the extra noise only corresponds 
to detector inefficiency. 

The output signal I(t) is low pass filtered with a 10 MHz cutoff to account for the bandwidth of the paramp. To 
create the feedback signal, we remove any dc offsets and multiply this output with the reference signal sin(Qoi)> where 
£!o/27T = 3 MHz. We then implement feedback by modifying Vln — > f2j? (t) in equations (2) and (3) using equations 
(10) and (11). Feedback loop delay (rdciay) is included by modifying the R.H.S of equation (11) so that t-¥t — Tdoiay 
while feedback circuit bandwidth is included by filtering fifb(t) with a 10 MHz low-pass filter before adding it to 
equation (10) . With the feedback modified qubit state p(t) we can compute feedback efficiency D as described in the 
previous section. 



C. Thermal fluctuations and higher qubit levels 



The discussion so far has assumed that the effective temperature of the qubit T" qu bit <C fi^oi/fcs where uj i is the 
transition frequency between the ground and first excited state. Even though the dilution fridge temperature T = 30 
mK, we find significant thermal population of the first excited state corresponding to an effective temperature of 
approximately 140 mK. We believe that this is due improper thermalization of the qubit sample inside the Aluminum 
cavity. Further, the transmon qubit has higher levels with similar transition frequencies between neighbouring levels 
(ui2 < woi) an d we observe few percent population in the second (and higher) excited states. 

We measure these populations using strong measurements which allows us to discriminate between the first 4 levels 
of the transmon with high single-shot fidelity 11-13 . Fig. S2 shows the single-shot histograms and one can clearly 
resolve four peaks in the distribution. These correspond to populations P = 0.83, Pi = 0.13, Pi = 0.03, P^ + = 0.01. 
Further, in the presence of Rabi driving (with or without feedback) the population in the 2nd excited state is enhanced 
up to 7.5 % and 2.5 % in higher levels. As mentioned in the main text, staet tomography of the stabilized state is 



6 



performed using single-shot strong measurements and hence we can remove data points where we find the qubit state 
in the second excited state or higher. This only affects the value of D by about 10%. For including these effects in 
the numerical simulations, we use a simple model described below. 

We consider only three levels (|0), |1), |2)) where |2) models all levels outside the |0)-|1) sub-space. Further, we only 
include the diagonal matrix element for |2) i.e. p 22 and all other terms involving the third level are set to zero. This 
is justified since we are only interested in the population of the higher levels and the off-diagonal terms representing 
the coherence are not important. Now the equations for the density matrix without the measurement terms (AI = 0) 
are given by 

Poo = O R Imp i + r i(/°n - Pn,st) (23) 

pu = -n R impoi - ri(pn - pn,st) + r 2 ( / o 2 2 - P22, s t) (24) 

Poi = i-^- (pn - Poo) - ^rPi2 (25) 

P22 = -r 2 (/322 - P22,st) (26) 

where in equation (25) we have replaced r cnv + Y\/2 in equation (3) with 1/T 2 * as explained in section III. Also pn )S t 
and p22,st are the steady state thermal populations in state |1) and |2) respectively and we use the experimentally 
determined values. We ignore any relaxation from |2) directly to |0) while the relaxation rate between |2) and |1) is 
modelled by r 2 . We set T 2 = 2Fi which is a good approximation for a weakly anharmonic qubit like the transmon. 

We now have a measurement current I 2 for the second excited state and we use I2 = 1 1 + AI which is a good 
approximation for a transmon in a cavity with \ "C k as is evident from Fig. S2. The Baycsian update for measurement 
now includes three levels. Equation (20) now becomes 

P(Im) = P00(t)P(I m I |0» + P u(t)P(Im I |1))+ P22(t)P(I m | |2» (27) 

where P(I m | |0)) and P(I m | |1)) are given by equation (21) and 



P(I m | |2» = ^^exp 



(J m - hf 
2a 2 



(28) 



is the conditional probability distribution for measurement of state |2). Measurement outcomes I m now use a combi- 
nation of a trinomial (to choose one of the levels) and a Gaussian random number generator. Equations 21 and 22 
are still valid for the Bayesian update but we have a new equation for p 22 given by 

MU^^ff. (29) 

As in the experiment, we run the numerical simulation for 80 ps and keep only those iterations where the system 
remains in the |0)-|1) sub-space sub-space. Fig. S3 shows a typical qubit trajectory generated from numerical 
simulations. A thermally excited quantum jump into the second excited state which lasts for about 10 ps is clearly 
visible. 
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FIG. SI: Dispersive shift and photon number calibration, a Ac stark shift Aw ac as a function of measurement drive 
power P. b Dephasing rate rR amscy = Y v + T2 as function P. Both quantities are extracted from Ramsey fringes by fitting 
an exponentially decaying sinusoidal function. The solid lines are linear fits with slopes m ac and m v respectively. The power 
range corresponds to n < 1. 




FIG. S2: Thermal population. The histograms are created from 3 x 10 single-shot measurements and cleary show four 
prominant peaks in the probability distribution corresponding to the first four levels of the transmon qubit. The blue trace 
corresponds to measurements made after letting the qubit relax to its steady state in the absence of any Rabi driving. The red 
trace corresponds to steady state with Rabi driving on the 0-1 transition and shows increased population in the second and 
higher excited states. 
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FIG. S3: Simulated qubit trajectory. The figure shows a typical qubit trajectory generated using numerical simulations 
and plots poo (blue) , pn (red) and P22 (green). At the beginning, the qubit is undergoing stabilized Rabi oscillations which 
is interrupted due to a thermally excited quantum jump into the second excited state. The system remains in that state for 
about 10 ps before falling back into the 0-1 sub-space and the Rabi oscillations resume. 



